[12] B. Shi, X. Bai, and C. Yao, “An end-to-end trainable neural network
for image-based sequence recognition and its application to scene text
recognition,” CoRR, vol. abs/1507.05717, 2015. [Online]. Available: http:
//arxiv.org/abs/1507.05717
[13] B. Shi, X. Wang, P. Lv, C. Yao, and X. Bai, “Robust scene text recognition
with automatic rectification,” CoRR, vol. abs/1603.03915, 2016. [Online].
Available: http://arxiv.org/abs/1603.03915
[14] L. Xing, Z. Tian, W. Huang, and M. R. Scott, “Convolutional
character networks,” CoRR, vol. abs/1910.07954, 2019. [Online]. Available:
http://arxiv.org/abs/1910.07954
[15] M. Liao, P. Lyu, M. He, C. Yao, W. Wu, and X. Bai, “Mask
textspotter: An end-to-end trainable neural network for spotting text with
arbitrary shapes,” CoRR, vol. abs/1908.08207, 2019. [Online]. Available:
http://arxiv.org/abs/1908.08207
[16] S. Ren, K. He, R. B. Girshick, and J. Sun, “Faster R-CNN: towards real-time
object detection with region proposal networks,” CoRR, vol. abs/1506.01497,
2015. [Online]. Available: http://arxiv.org/abs/1506.01497
[17] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. E. Reed, C. Fu, and A. C.
Berg, “SSD: single shot multibox detector,” CoRR, vol. abs/1512.02325, 2015.
[Online]. Available: http://arxiv.org/abs/1512.02325
[18] J. Redmon, S. K. Divvala, R. B. Girshick, and A. Farhadi, “You only look
once: Unified, real-time object detection,” CoRR, vol. abs/1506.02640, 2015.
[Online]. Available: http://arxiv.org/abs/1506.02640
[19] D. Deng, H. Liu, X. Li, and D. Cai, “Pixellink: Detecting scene text via
instance segmentation,” CoRR, vol. abs/1801.01315, 2018. [Online]. Available:
http://arxiv.org/abs/1801.01315
[20] S. Long, J. Ruan, W. Zhang, X. He, W. Wu, and C. Yao, “Textsnake: A
flexible representation for detecting text of arbitrary shapes,” CoRR, vol.
abs/1807.01544, 2018. [Online]. Available: http://arxiv.org/abs/1807.01544
[21] Y. Xu, Y. Wang, W. Zhou, Y. Wang, Z. Yang, and X. Bai, “Textfield:
Learning A deep direction field for irregular scene text detection,” CoRR, vol.
abs/1812.01393, 2018. [Online]. Available: http://arxiv.org/abs/1812.01393
[22] K. He, G. Gkioxari, P. Dollár, and R. B. Girshick, “Mask R-CNN,” CoRR, vol.
abs/1703.06870, 2017. [Online]. Available: http://arxiv.org/abs/1703.06870
[23] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for
image recognition,” CoRR, vol. abs/1512.03385, 2015. [Online]. Available:
http://arxiv.org/abs/1512.03385
[24] H. Law and J. Deng, “Cornernet: Detecting objects as paired keypoints,”
CoRR, vol. abs/1808.01244, 2018. [Online]. Available: http://arxiv.org/abs/
1808.01244
97